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code reader (30) operative in conjunction with the en- 
coded sheet of material for reading the code; a sheet 
processing apparatus (1 1 0) for reading information from 
and/or writing information to at least one of the first and 
second surfaces, and a processor in communication 
with the code reader device and the sheet processing 
apparatus for associating the information with the read 
code. 
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(57) An encoded sheet material includes a sheet of 
material (1 0) having a first surface, a second surface 
disposed opposite the first surface and an edge (11,13) 
extending between the first surface and the second sur- 
face and peripherally about the sheet of material, the 
edge having indicia (12) arranged thereon to form a 
code uniquely identifying the sheet of material. A system 
for managing an encoded sheet of material, includes a 
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Description 

[0001] This invention relates generally to encoded 
sheet material, and more particularly to systems and 
methods for managing encoded sheet material having 
information recorded thereon. 

[0002] Despite the publicity about the paperless of- 
fice, paper remains an important media in today's work- 
ing environment. Many efforts have been made to inte- 
grate paper documents with computer-based informa- 
tion systems. These efforts generally involve two sce- 
narios. The first scenario involves scanning an existing 
physical document to create a digital copy, assigning a 
digital file name and then managing the digital copy as 
any other digital file. The second scenario involves cre- 
ation of a physical documentf rom an existing digital doc- 
ument orfile such as by printing. To aid in the Integration 
process, a barcode or a Dataglyph may be printed or 
otherwise attached to a physical document. Dataglyphs 
are generally less visually disruptive than barcodes. 
Both barcodes and Dataglyphs provide a means for the 
computer to grasp intentionally printed information on 
the paper document. Since both are generally applied 
at the time the information is recorded on the sheet of 
paper (but may be applied later through the use of an 
adhesive label), both generally appear on the same face 
of the sheet of paper as the recorded information. 
[0003] At the organization level, many documents, 
such as contracts, reports, files, technical documenta- 
tion, etc., have to be physically stored for legal, admin- 
istrative or operational reasons. Attention must be paid 
to their indexing and classification in order to keep doc- 
ument retrieval cost at an acceptable level. One solution 
is to physically attach a barcode or Dataglyph to an ex- 
isting paper document and scan the document into the 
computer-based information system. 
[0004] Having a record of a paper document in a com- 
puter-based information system does not solve the 
problem of knowing where the actual paper document 
is located. Even if location information is stored at the 
time the paper document is input into the system (such 
as when the barcode or Dataglyph is read by the com- 
puter-based information system), the paper document 
may have been moved when a user retrieved it and later 
replaced It. 

[0005] Most paper documents in offices do not con- 
tain barcodes or Dataglyphs and are not part of a com- 
puter-based information system. Most offices have piles 
of document on shelves, desks and the like that are clas- 
sified in an ad-hoc and personal manner. Finding a doc- 
ument in one of these ad hoc filing systems often means 
browsing through several piles to find a particular doc- 
ument. 

[0006] The verification that a paper or hard copy ver- 
sion of a document is an original, as opposed to a copy 
or imitation, is important in many businesses and legal 
transactions. Examples include contracts, stock certifi- 
cates, bank notes, premium bonds, etc. Verification of 



authenticity of an original document has become harder 
to perform since modern techniques enable the produc- 
tion of high quality copies which are frequently difficult 
to detect from the original. Even if an original paper doc- 
$ ument included a barcode or Dataglyph on the face of 
the document, any copy of the paper document will also 
include the same barcode or Dataglyph. 
[0007] Much work has been done in order to verify the 
integrity of the content of a hardcopy document as well 
™ as its origin (this is referred to as authenticating a doc- 
ument). Verifying the content and origin (authenticating) 
of a hard-copy document consists of making sure that 
its content was not tampered with, and that it really orig- 
inates from the supposed source. This prevents manip- 
13 ulation of the content of a document, while true (i.e. not 
manipulated) copies remain possible. Originality check 
is concerned with discriminating copies from the original 
hardcopy document. The problem of originality is closely 
related to the problem of authentication because in most 
20 cases where originality is important, the content is also 
important. However, in some cases the originality of a 
sheet of paper rtseff is important, independently of its 
content. Consider for instance the case of asheet of pa- 
per being circulated to collect signatures for a petition. 
25 When it comes back, the originator expects to have the 
original sheet (and not a possibly manipulated copy). 
[0008] The use of edge marking of sheet materials 
has been proposed for various applications. US Patent 
5,085,417 to Copham, Method of Encoding Stacks of 
so Printed Materials, describes a process for using edge 
markings to identify one customer's order for form 
checks from another customer's. 
[0009] EP-A-1 01 7017 describes pre-marking of edg- 
es of paper reams/stacks (during manufacture) with in- 
35 formation related to the paper's physical properties (e. 
g. its weight, color). 

[0010] An encoded sheet material, according to the 
invention, includes a sheet of material having a first sur- 
face, a second surface disposed opposite the first sur- 
40 face and an edge extending between the first surface 
and the second surface and peripherally about the sheet 
of material, the edge having indicia arranged thereon to 
form a code uniquely identifying the sheet of material. 
A system for managing an encoded sheet of material 
45 includes a code reader operative in conjunction with an 
encoded sheet of material for reading an edge code; 
wherein the encoded sheet of material has a first sur- 
face, a second surface disposed opposite the first sur- 
face and an edge extending between the first surface 
50 and the second surface and peripherally about the sheet 
of material, the edge haying indicia arranged thereon to 
form a code uniquely identifying the sheet of material; a 
sheet processing apparatus for reading information 
from and/or writing information to at least one of the first 
55 and second surfaces of the encoded sheet material; and 
a processor in communication with the code reader and 
the sheet processing apparatus for associating the in- 
formation with the edge code. 
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[0011] By providing each sheet of material with a 
unique edge identifier, any information that may be re- 
corded on the sheet of material may be associated with 
that sheet of material. By placing the unique identifier 
on the edge, both surfaces are available for recording 5 
information. The edge marking can be made with a vis- 
ible or an invisible ink. If the recording device includes 
an edge reader coupled to a processor with a memory, 
whenever a user makes a copy of an electronic file, the 
recording device reads the edge marking on each sheet 10 
of material used, and the processor associates that 
sheet of material with the electronic file. This association 
can be stored in memory. This feature is useful for track- 
ing or monitoring physical copies of an electronic file. 
Additional information or meta data may also be asso- *5 
ciated with the electronic file. 

[0012] The association information may be stored and 
used for other purposes, such as monitoring the number 
of copies made of a particular file, for monitoring the lo- 
cation of the copies and for monitoring the number of 20 
sheets of material used. The association information 
can be made or updated at any time. For example, if an 
electronic file is printed on a sheet of material with a 
unique edge marking, that association may be made 
and stored in a memory at the time of printing or later. -25 
If that recorded sheet of material is used to make a pho- 
tocopy, an edge reader in the copier can make an asso- 
ciation of the read edge marking of the "original hard 
copy - with the edge marking of the sheet of material 
used to make the photocopy. This information may be 30 
stored in memory and can be used to update the asso- 
ciation information with the original electronic file and 
create a new association for the "original hard copy." 
[001 3] Retrieving information associated with a sheet 
is accomplished by reading its edge identifier and que- 35 
ryingthe infrastructure to retrieve this information, given 
the identifier. Preferably, sheets of material are pre- 
marked at production time. If pre-marked at production 
time, each sheet can be given a code identifying the 
ream to which it belongs as well as uniquely identifying *o 
that sheet. The code can include a portion identifying 
the ream, manufacturer, and other information that a us- 
er might require. Some reams of sheet material may be 
specially coded with special visible and/or invisible inks 
and used as special bond paper for financial instru- 45 
ments, for example. Indeed, some organizations may 
wish to reserve special reams of material: 
[0014] Edge-readers can either be embedded in the 
recording devices (such as printers, facsimile machines, 
photocopiers, shredders, etc.) or affixed in work places 50 
(e.g. desktops). The edge readers are coupled to a com- 
puter or network where the read association information 
may be read and/or written. The edge readers enable 
the automatic association of printed-sheet <-> docu- 
ment Users may also use any sheet of a document ei- 55 
ther to obtain related service by passing the sheet 
through an edge-reader, or to establish an association 
in a similar way. 



[0015] Documents in paper form are largely used in 
almost all businesses. Documents are frequently stored 
in an ad-hoc manner (e.g., on personal shelves in an 
office) or formally (e.g., iegalor contractual documents 
are stored in filing cabinets, etc; in banks, administration 
offices, etc.). Document classification and retrieval is of- 
ten problematic and costly. The method of the invention 
provides computer support to these two tasks, without 
disrupting the user's normal work practice, and requires 
less effort. The invention enables documents to be au- 
tomatically indexed in place (i.e., in their normal storage 
area) and retrieved by obtaining storage information 
from a computer-based system. 
[0016] The encoded sheet material of the invention 
may be used to index and manage physical documents. 
Indexing and managing physical documents generally 
involves four steps. First, the physical documents must 
be indexed, which often includes generating some form 
of unique identifier. Then the physical documents must 
be inserted in the storage area, i.e., in accordance with 
some classification scheme. Retrieving the stored phys- 
ical document involves searching through the storage 
area using the classification scheme. Finally* after use, 
the document must be re-inserted into the storage area, 
which location may be different from the original loca- 
tion. 

[0017] Physical documents may be edge marked and 
the edge markings used to index and retrieve them from 
their storage locations. A method of managing a physi- 
cal document, wherein the physical document includes 
at least one sheet of material, includes marking an edge 
of the at least one sheet of material with indicia arranged 
to form a code identifying the sheet of material, storing 
the physical document in a storage location, indexing 
the physical document by associating the code with the 
storage location, and storing the indexing information in 
a computer-based information retrieval system. A meth- 
od of retrieving a physical document stored in a storage 
location, includes obtaining a code associated with the 
physical document, wherein the physical document in- 
cludes at least one sheet of material having a first sur- 
face, a second surface disposed opposite the first sur- 
face and an edge extending between the first surface 
and the second surface and peripherally about the sheet 
of material, the edge having indicia arranged thereon to 
form the code identifying the sheet of material, and in- 
formation recorded on at least one of the first and sec- 
ond surfaces of at least one of the sheets of material, 
using a scanning device to scan the storage location for 
the code, and when the output of the scanning device 
indicates the location of the sheet of material having 
code, retrieving the physical document. 
[0018] The invention provides a computer-imple- 
mented system for the indexing, storage and retrieval of 
paper document from piles, filing cabinet, shelves and 
more generally from document storage areas where 
document edges are visible. By using a robust code, 
such as a large barcode on the edge of the documents, 
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which is visible (to the particular scanning device) on the 
edge of the documents, physical documents may be in- 
put into a computer-based system and located for re- 
trieval. In addition to storing the edge codes and location 
information, images of the storage area (shelves, piles, s 
etc.) may also be stored in the computer-based system. 
Documents can be located by scanning the storage lo- 
cation for the document's edge code or by accessing 
the computer-based system and retrieving the storage 
location. : 10 

[001 9] Since each sheet of paper holds a unique iden- 
tifier (preferably) pre-marked on its edge at production 
time, indexing is automatic; the pre-marked edge codes 
constitute the minimal required index. However, this 
does not preclude the use of other forms of indexing, in 15 
addition. Since the physical documents can be easily 
retrieved (by scanning stacks of files for their edge 
codes), storage of physical documents becomes as sim- 
ple as stacking documents as they arrive. However, any 
other classification scheme can also be used. Docu- 20 
ments must be stacked so that the edge codes can be 
read. Physical document retrieval is computer or net- 
work supported; the computer or network indicates via 
an output device where the document is located. For ex- 
ample, the computer may display an image of the stor- 2s 
age area where the document is located. Reinsertion of 
a physical document is equally simple. 
[0020] An original document is one from which a copy, 
reproduction or translation is made. In the case of a con- 
tract, the original contract is the one (or ones in the case 30 
of duplicate originals) with the original signatures affixed 
to it. Originality goes to a document's content as well as 
physical integrity (i.e., the particular sheets of paper 
used). Authenticity of a document goes to the integrity 
of the information, i.e., whether the information con- 35 
forms to the information in the original. An authorized 
copy of an original document, is authentic if it conforms 
to the original so as to reproduce essential features. The 
invention enables the authentication of originals as well 
as copies. The invention provides a method of authen- 40 
ticating the sheets of paper (material) on which docu- 
ments are recorded as well as authenticating the infor- 
mation on the sheets of material. 
[0021 ] A method of creating an aut henticatable sheet 
of material, according to the invention, includes meas- 45 
uring at least one physical property of the sheet of ma- 
terial; marking an edge of the sheet of material with in- 
dicia arranged to form a unique code identifying the 
sheet of material, wherein the sheet of material includes 
a first surface, a second surface disposed opposite the so 
first surface and an edge extending between the first 
surface and the second surface and peripherally about 
the sheet of material; and recording the measured phys- 
ical property in a measure database indexed by the 
edge code. To verify the validity or authenticity of a sheet ss 
of material, the edge code is read, the same physical 
property is measured and the measured value is com- 
pared with the previously stored value extracted from 



the measure database. If the two are substantially 
equal, the sheet of material is authentic. 
[0022] The use of edge codes and physical property 
measures can be used to authenticate a document, i.e., 
a sheet of material on which information has been re- 
corded, and verify its originality. A method of creating 
an authenticatable physical document which includes 
information recorded on a surface of the sheet of paper 
includes using the edge code with the information to 
generate an encryption hash, digitally signing the en- 
cryption hash and recording the digitally signed encryp- 
tion hash on a surface of the sheet of material. If a phys- 
ical property has been measured and indexed in a 
measure database, the originality of the sheet of mate- 
rial can be verified as well. A digital signature does not 
hide the content of the information, but is used primarily 
to guarantee the identification of the sender of the infor- 
mation and Its integrity. For added security, the encryp- 
tion hash may be encrypted rather than just digitally 
signed. 

[0023] Verifying the authenticity of a sheet of material 
involves reading the digitally signed hash on the docu- 
ment, decrypting it generating a new hash from the 
edge code and a portion of the information. The two en- 
cryption hashes are then compared. If they are equal, 
the document is authentic. Additionally, the edge code 
can be used to determine if the sheet of material is orig- 
inal, by measuring the physical property and comparing 
its value to the value stored in the measure database. 

Brief Description of the Drawings 

[0024] 

Figure 1 is a schematic of an example edge marking 
according to the invention; 

Figure 2 is a diagram of a scheme for an edge mark- 
ing including a ream marking and individual sheet 
marking; 

Figure 3 is a perspective diagram of an edge reader 
reading an edge-marked sheet of material; 
Figure 4 is a schematic of an apparatus for manag- 
ing a sheet of material according to the invention; 
Figure 5 is a flow chart of a method of indexing and 
retrieving physical documents according to the in- 
vention; 

Figure 6 is an example of 12 characters in Code 39 
barcode; 

Figure 7 is an example of the code "E-PLACARD" 
in Code 39 barcode; 

Figure 8 is a photograph of a 30-page document 
with the code of Figure 7; 

Figure 9 is a photograph of the barcode extracted 
from Figure 8 and enlarged vertically; 
Figure 10 is a photograph of a screen showing the 
results of a standard barcode reader reading the 
barcode in Figure 8; 

Figure 1 1 is a photograph of an image of the location 
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of the F-Pi AHARD document: 
Figure 12 is ft schematic of a method of creating an 
authentic ohysical document. 
Figure 13 is a schematic of a method of authenti- 
cating a pnyscal document: and 5 
Figure 14 is a schematic el a method of authenti- 
cating a sheet o' material 

[0025] Most sheets of material such as paper sheets 
have six faces, two of them being commonly used; the 10 
so-called recto (first surface) and verso (second sur- 
face). The four other faces, the edges, may be used to 
uniquely identify each sheet and, optionally, the ream it 
comes, manufacturer and so on. If a ream identifier is 
used as part ol the unique sheet code, each sheet in the is 
ream will receive the same unique ream identifier, in ad- 
dition to a unique sheet identifier. These identifiers can 
be>marked with visible or invisible ink. 
[0026] Typical A4 paper sold today presents on the 
long and short edges a combined edge surface of 297 20 
x 0.1 mm 2 and 210 x 0,1 mm 2 , respectively. A robust 
code can be easily devised to fit within this size con- 
straint. (A robust code is one which can be easily read 
on the edge of a single sheet of material and also affords 
a sufficiently large number of codes for the anticipated 25 
amount of sheets of material to be used.) For example, 
a 64 bit long code made of 1 mm wide bars on a 2 mm 
grid would require 128 mm, which leaves enough room 
(say 64 mm for 32 bits of error code) for error correction 
code. In total this code (with error correction) would re- 30 
quire 192 mm. and would also fit on letter size sheets 
(8 1 v£ inches by 11 inches) . A typical edge thickness of 
0.1 mm is sufficient height to write the bar constituting 
the code. For instance, to give an order of magnitude, 
the dots of a 600dpi printer are 0.04 mm high. Two of 35 
these dots fit on the edge height of 0.1 mm. Thus the 
edge surface of typical paper sheets. is thick and long 
enough to hold the example robust codes. A 64 bit long 
code provides enough unique codes to enable each in- 
habitant of the earth to consume 80,000 sheets of paper *o 
(material) per day for over 100 years. 
[0027] There are many schemes which may be used 
to create a unique code to mark the edges of sheets of 
material. In addition to a unique sheet identifier, the edge 
code may also include information identifying the man- 
ufacturer, date of manufacture, a ream identifier, etc. 
Two specific examples will be described herein , howev- 
er, the invention is not limited to only these two. The ex- 
emplary coding schemes described below include a 
unique sheet identifier and a ream identifier. so 
[0028] The first scheme uses the long edge for the 
sheet identifier and the short edge for the ream identifier. 
Referring to Figure 1 , a sheet of material, such as paper, 
1 0 includes long edge 13 and short edge 11 . Long edge 
13 includes ream identifier 14; short edge 11 includes ss 
unique sheet identifier 12. While both identifiers are 
shown in Figure 1 as bars, other forms of markings may 
be used, such as diagonal lines, dots and the like. These 



marks may be visible or invisible. Optional recorded in- 
formation 1 6 may be recorded on surface 1 5 of sheet 1 0. 
[0029] The second exemplary coding scheme codes 
the ream and sheet identifiers on the same edge, thus 
allowing pre-marking of the two identifiers on all four 
edges, which may be advantageous for some applica- 
tions. Referring to Rgure 2, ream 20 includes a plurality 
of individual sheets of material 10 (typically there are 
500 sheets in a ream). In this coding scheme, the ream 
identifier and sheet identifier are on one edge. A ream 
identifier 22 is pre-marked at one section of the com- 
bined edges. In this embodiment, the ream identifier 
consists of bars. The sheet identifier is created by offset 
line 24, drawn across the entire ream. This marking 
scheme enables pre-marking of the entire ream at the 
same time. The offset 25 between the offset indicator 
mark 27 and the end of the paper edge 29 is sufficient 
to discriminate each sheet of a ream. 
[0030] In this example, the value of unique identifier 
for sheet M of a ream of N sheets of material may be 
computed as the distance 23 or as the ratio of the dis- 
tance 23 over the entire distance 23+25. This value in 
conjunction with the ream portion 22 provides a unique 
identifier for sheet M. Ream identifier 22 may also in- 
clude manufacturer name, date of manufacture, type of 
sheet material, etc. 

[0031 ] For a typical 20 pound ream of photocopy pa- 
per (height of 5cm, sheet thickness of 0 .1 mm, and 20cm 
of the ream edge dedicated to the offset line), the offset 
of two consecutive sheets is 0.4mm. In addition, drawing 
thicker lines parallel to the offset indicator line 24 will 
assist the edge reader device evaluate each sheet offset 
(because of the regularly distributed small plots on the 
edge of each sheet). Other alternatives exist, such as 
any asymmetric curvilinear function drawn in place of 
straight line 24 (provided the curve provides a unique 
code for each sheet). 

[0032] By marking sheets of material at production 
time with industrial means, invisible inks, fluorescent 
dyes or other technical inks may be used (in addition to 
standard visible inks or a combination of visible and in- 
visible inks). While it is anticipated that most sheets of 
material will be pre-marked : i.e., during manufacture 
and before an information is recorded on the surface, 
pre-marking is not required. Edges may be marked with 
a code after a sheet of material has been printed or re- 
corded with information. Edge marking after information 
is recorded is usefuf for existing physical documents, 
such as reports, articles, magazines, books, etc. 
[0033] An edge reader is used to read the edge code 
on a sheet of material or physical document. Edge read- 
ers may be employed on desktops as individual units or 
embedded in devices such as facsimile machines, print- 
ers, copiers, shredders, etc. An example of an edge 
reader is shown in Figure 3. Other edge readers may 
be used, such as a video camera or light pen. Referring 
to Figure 3, a sheet of material 10 with a marked edge 
1 2 passes through edge reader 30. The edge reader 30 
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reads the edge code, generates an edge code read sig- 
nal and sends it to computer 110 or network 100. Com- 
puter 110 or network 100 associates the edge code is 
associated with other information about the physical 
document 10, such as whatever information is recorded 
on a surface of sheet 10. As described below, the edge 
code can also be associated with storage location infor- 
mation of the physical document. 
[0034] Manual readers may be installed in any place 
where they allow users to conveniently read the edge 
identifier of a sheet of material. Typically, such a reader 
is affixed on the user's desk and connected to a desktop 
computer. An edge reader may include digital circuitry 
coupled with a bar sensor in front of which the sheets 
are translated (either manually by the user, or automat- 
ically when embedded in another device) . The bar sen- 
sor may be an LED and sensor couple, without any me- 
chanical parts. The digital circuitry decodes the edge- 
printed identifier* possibly in reverse direction because 
of possible rotations of the sheet of paper. 
[0035] Referring to Figure 4, a system for managing 
a sheet of material is shown. Sheet of material 1 0 is pre- 
marked during manufacture with a unique code on an 
edge. Sheet 10 is provided to edge reader 30 which 
reads its code and provides it to processor 50. Sheet 10 
may also be provided to sheet processing apparatus 40 
which may be a printer, which will print information from 
a digital file onto a surface of sheet 10 or a scanning 
device which will read recorded information from a sur- 
face of sheet 10 and create a digital file from it. Edge 
reader 30 and recording device 40 may be separate de- 
vices or edge reader may be embedded in recording de- 
vice 42. Recording device 40 reads or writes recorded 
information to processor 50. Processor 50 associates 
the read edge code with the recorded information and 
stores the association in memory 52. Alternatively, proc- 
essor 50 may store the association information on a net- 
work 60. Each time the sheet 10 is processed by edge 
reader 30 and/or recording device 40, the association 
information may be updated. 

[0036] In addition to creating associations between 
sheets of material and digital files, associations may be 
retrieved using the method and system of the invention. 
If a physical document includes at least one sheet of 
material with an edge code (and presumably recorded 
information on at least one of the sheets), the sheet of 
material may be read by an edge reader coupled to an 
information retrieval system. If the read edge code has 
been previously associated with some information, that 
information will be retrieved by sending the sheet of pa- 
per through an edge reader and searching for any stored 
associations with that edge identifier. 
[0037] Various infrastructures may be used to associ- 
ate a sheet of material with an edge code identifier with 
some information, and to retrieve the latter given an 
identifier. In particular the infrastructure disclosed in co- 
pending, co-assigned U.S. patent applications number 
09/276.085 filed March 25, 1 999,"Marking Medium Area 



with Encoded Identifier for Producing Action through 
Network" and U.S. application number 09/276,532 filed 
March 25, 1999, "Obtaining Network Addresses from 
Identifiers" may be used. These applications describe 
5 how a coded number is resolved to the Internet address 
of a document (or action) through indirection via a cen- 
tralized router and devises efficient routing schemes 
(which allow codes of the order of 64 bits). Because of 
the indirection level provided by the router, paper can 
10 be sold with the code pre-marked on their edges; asso- 
ciation with an existing reference may be done a poste- 
riori through a software process of linking the code, to 
the document address in the routing tables. 
[0038] Because of the important role of paper in work- 
's places, the ability to easily associate any piece of paper 
with electronic information is an essential feature for cre- 
ating and developing knowledge sharing solutions. For 
example pre-marking of unique identifiers on paper 
sheets makes it possible to turn any page of a user's 
20 documents into a document token. There is no need to 
print additional machine-readable code, since the 
sheets are pre-marked, but only to associate the pre- 
marked identifiers with the electronic document, or with 
whatever is needed for further usage as document to- 
25 ken. 

[0039] For example, if sheet processing devices 
which output printed paper (printer, fax, copier) are 
equipped with an edge reader and connected to the in- 
frastructure of the above described co-assigned appli- 
30 cations, it is possible to (associate every printed sheet 
with application -relevant information. For example, a 
printer associates the produced document with some 
electronic information, by associating (the range of) the 
identifiers of the constituting sheets with it. Optionally, 
35 only the first and last sheets are pre-marked sheets in 
order to reduce the visual "gray" effect on the edge of 
printed document (two different paper stacks may be 
used, one with pre-marked paper and the other with nor- 
mal paper). The copier associates the identifier of the 
40 original paper to its copy's identifier, or possibly resolves 
the former before establish ing the association. A facsim- 
ile machine works similar to the printer. In turn, the print- 
ed document is known to the system and any of its 
sheets can act as a document token once passed in an 
45 edge reader. 

[0040] Thus a reprint may be obtained by reading the 
edge of a document sheet at a "reprint" edge reader af- 
fixed close to a printer or copier. The electronic version 
of a document may be opened on a computer by passing 
50 the printed version in the desktop edge reader. 

[0041 ] Uniquely identified sheets may be used in con- 
junction with the system described in application 
number 09/276,085 dealing with the recognition of the 
document the person is using. By combining the two, 
55 "Intelligent Papers" may be easily produced. Either the 
publisher prints the document on an edge-reader- 
equipped printer to establish the association between 
the paper document and its electronic counterpart, or it 
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associates the document with the pre-printed ream 
identifier(s). The latter applies particularly well for large 
volume as it avoids reading each sheet identifier when 
printing and as it reduces the infrastructure load for as- 
sociation and resolution. The device used by the user 
must be equipped with an edge reader in order to iden- 
tify the electronic counterpart of the document (by que- 
rying the normal Intelligent Paper infrastructure). 
[0042] Visible edge codes can be used to index, store 
and retrieve physical documents. A visible code may be 
one that is visible to the eye, and it may also be one that 
is invisible to the eye and visible to a detector, such as 
an infrared detector. A flow chart of the various steps in 
the method is shown in Figure 5. The first step is to mark 
the edge of at least one sheet of a physical document 
(step 70). Preferably this is accomplished during man- 
ufacture of the sheet of material (such as paper), but it 
may also occur after manufacture. For multi-page doc- 
uments, one or some or all of the individual sheets may 
be. edge marked with an identifying code. The code is 
associated with the physical document, such as by doc- 
ument meta data (e.g., URL, title, author, type, topic, 
date, file name, or some other convenient reference that 
may be user selected). Associating the code with the 
physical document ties the content of the document to 
the sheets of material. 

[0043] In step 72 the physical document is stored in 
the desired physical storage location (e.g., pile of doc- 
uments on a desk, file drawer, room, etc.). In step 74 the 
code is associated with the actual location in storage of 
the physical document. In step 76, the location associ- 
ation information is stored in a memory. The association 
information includes document code and location infor- 
mation. It may also include the meta data previousiyLast _ 
sociated with the code. An image of the storage location, 
where the physical document is located may also be as- 
sociated with the code. 

[0044] If a document is already in a storage location 
and it has an edge code, but it has not been indexed 
into the system, it can be indexed by scanning the stor- 
age location for the document's code (step 78). When 
the code is located (step 80), the storage location is as- 
sociated with the code (step 74) and that information is 
stored in memory (step 76). 

[0045] Retrieving a document indexed in the system 
is accomplished by searching the system's memory for 
the document's code (step 82), reading the location in- 
formation associated with the code (step 84) and then 
retrieving the document from the storage location (step 
86). 

[0046] There are several ways to make visible a code 
on the document edge and to establish the association 
between the edge-visible code and a document. Prefer- 
ably, the edge of paper sheets is pre-marked at produc- 
tion time with a code uniquely identifying each sheet and 
each ream of paper, a software infrastructure that per- 
mits associating the edge code with some data relevant 
to the document (usually the identity - such as the URL 



of the document - but possibly also with meta data) is 
provided and edge-reader are provided both embedded 
in devices like printers and copiers and provided to us- 
ers as a desktop tool. 

5 [0047] The edge marking scheme shown in Figure 2 
may be used. Since most documents will contain sev- 
eral pages, we are interested in the visual effect of stack- 
ing the sheets of a document, where the ream identifiers 
and offset indicators become aggregated. It is reason - 

10 able to assume that in most cases, most of the sheets 
of a document come from the same ream. Because sev- 
eral sheets of the printed documenthave the same ream: 
identifier, this identifier will become clearly visible on the 
document edge. In a similar manner, the document edge 

*5 will exhibit a portion of the offset indicator line, as shown 
in Figure 2. 

[0048] The combination of the aggregated ream Iden- 
tifier and offset tine portion uniquely identify the docu- 
ment. Note that these marks may be invisible to the hu- 

2° man eye because they are marked with invisible ink, or 
semi-visible. In some situations it may not be necessary 
to read the entire code on each sheet of paper. A sinv 
plification of the method may be made by using only the 
ream identifier. The retrieval service may be slightly de- , 

25 graded: the computer system may indicate several lo- 
cations for the requested document, corresponding to 
the several documents printed from the same ream. An 
advantage of this simplified version resides in easier Im- 
age processing, in particular if only the ream identifier 

30 is written on the short edge of sheet and is therefore 
very large. This simplified version may fit well in office 
settings, where several users share a printer and there 
is little chance the same user will obtain several docu- 

ments_from_tbe_same_ream.^ 

35 [0049] In addition to the edge marking described 
above, other methods of marking edges of sheets of pa- 
per may also be used in the indexing, storage and re- 
trieval method. 

[0050] Once a code is provided on the edge of at least 

^0 one sheet of the document, the code may be associated 
with the docurnent or the document meta data in one of 
several ways. Preferably, the association is established 
at print time in an automatic manner as described 
above. The printer is equipped with an edge reader, 

45 which decodes the unique sheet identifier, and estab- 
lishes the association between the document and the 
sheet identifier. For the association, the infrastructure 
described in co-assigned patent applications 
09/276,085 and 09/276,532 may be used. Very little 

50 modification is required to implement this system; the 
printermust have an edge reader embedded or coupled 
to it. Paper surfaces are free from any mark and can be 
imprinted with any content. Alternatively, the user may 
explicitly establish the association once the document 

55 is printed, by presenting it to a sensor (edge reader, 
camera, scanner). 

[0051] Location information can be stored as an im- 
age with the document association information. A cam- 
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era can be used »o obtain an image of the document in 
the storage area The image can be grabbed either on 
demand (when a search occurs) or periodically to main- 
tain an up-to-date index ol location of documents. The 
latter also permits reporting on document availability s 
and in some way backing document usage. An invento- 
ry of stored documents is also available to the user. 
[0052] Because of resolution issues, one image may 
not be adequate to identify and read edge codes. Ob- 
taining a higher-resolution image of the storage area, in 10 
particular for large areas, is possible, for example, by 
overlapping snapshots If overlapping snapshots is in- 
sufficient to locate a document, image mosaicking may 
be used, but is computationally more costly (in order to 
stitch togetherslightfy high-resolution overlapping snap- is 
shots). Once the computer has determined the location 
of a searched document, the location may be commu- 
nicated to the user by displaying n image of the storage 
area, for instance, with the exact location of the docu- 
ment highlighted in the image. This provides a natural 20 
and intuitive way of communicating location to users. 
[0053] In addition to cameras, a laser detector cou- 
pled with a laser beam may be used to point out the doc- 
ument to the user. A more futuristic approach would be 
for the user to wear a computer equipped with a camera, 
and to have a glass-mounted screen. Augmented reality 
techniques would then allow the user to directly see 
where the document is within the storage area. 
[0054] The identification of regions holding a code 
and its decoding requires image-processing tech- 
niques. In order for these techniques" to work, the code 
must have sufficient resolution. While many different 
codes may be used and have the required resolution 
depending on the image processing equipment used, 
we have demonstrated the feasibility of a widely used 
barcode named Code 39. Encoding codes of the order 
of 2 s4 bits requires 12 characters, given the alphabet of 
42 symbols of the Code 39. Each character encoded in 
a Code 39 symbol is made up of 5 bars and 4 spaces 
for a total of 9 elements. Each bar or space is either 
"wide" or "narrow" and 3 out of the 9 elements in any 
given character are wide, giving the code its other name 
- Code 3 of 9. Consider narrow bars of 1.25mm and 
wide bars of 2.5mm. The 12-character code has a total 
length of 1 96mm, and fits on the short edge of a sheet 
(see Figure 6). For the long edge, much thinner bars can 
be used to preserve room for the offset indicator line. 
For example, using more classical 0.25 mm width bars 
makes the code length back to 4cm. 
[0055] Consider now that a camera with a 1600 x 
1200 pixels resolution grabs one picture of an area of 1 
meter x 0.75 meters]. Each pixel represents 0.625 mil- 
limeters x 0.62 millimeters. Two pixels cover the width 
of one narrow bar. Four pixels cover the width of a wide 
bar. This fits our needs, while not taking into account 
higher resolution obtained via image mosaicking. There 
may be a problem for documents that do not exhibit a 
clear image on their edge, e.g. there is no block of con- 



tiguous pages coming from a single ream that is large 
enough to form an image. This can happen for a very 
small document or for a document spanning over sev- 
eral reams. The case of paper jam may also slightly trou- 
ble the image of the offset line, although probably not 
seriously. All of these problems can be handled in two 
complementary ways. First, the printer can detect these 
problems when they occur, because it reads the edge 
identifier of each sheet. Once a serious problem is de- 
tected, it can notify the user and print the document 
again if the problem was transient (paper jam, several 
reams). Small documents may possibly not work at all, 
unless additional blank pages (or some special separa- 
tors) are added to them. Second, the user can detect 
these problems simply by looking at the edge of the doc- 
ument (if ink is visible or semi-visible). 
[0056] To demonstrate the method of the invention, a 
30-page document edge marked on the bottom (short 
edge) of each page the barcode shown in Figure 7. The 
barcode was printed at the bottom of each page using 
Microsoft Word with a zero width margin on a Xerox 
DocuPrint 4517. The barcode is actually at the bottom 
of the page, bleeding over onto the edge. When the 
30-page document is stacked in with a group of un- 
marked documents, the stacked, the document edge 
appears as shown in Figure 8. Figure 8 is a photograph 
of the document taken with a Kodak digital camera 
DC50 at a resolution of 756x504 pixels. The picture cov- 
ers approximately 30cm x20 cm. Figure 9 is a photo- 
graph of the barcode extracted from Figure 8 and en- 
larged vertically. A standard barcode reader was able to 
read the encoded value: E- PLACARD (see Figure 10). 
When queried for the location of the document encoded 
E-PLACARD, the system returns the image shown in 
Figure 11 with stored association information. 
[0057] A method of creating an authenticatable sheet 
of material includes using the sheets of material with 
unique edge codes described above. For each uniquely 
identified sheet of material, such as paper, the measure 
of some physical property is taken, preferably at produc- 
tion time (the measure could be taken any time before 
the sheet of material is to be recorded with information). 
The measure is recorded in a measure database in- 
dexed by the edge code (sheet identifier) of the sheet 
of material. The proof of originality of an individual sheet 
of paper is established by measuring again the same 
physical property and by comparing the measure to the 
original one obtained from the measure database. The 
unique identifier (edge code) of each sheet of material 
permits retrieval of the original measure (taken at pro- 
duction time or some previous time) from the database). 
[0058] One or more physical properties may be meas- 
ured and the measured value stored in the measure da- 
tabase. For paper materials, paper fiber arrangement or 
ink penetration may be measured. The infrastructure for 
associating information with a sheet of material de- 
scribed above may be used to associate the measured 
physical properties with the sheet of material's edge 
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code. The physical property information may also be as- 
sociated with any physical document (in which informa- 
tion such as article title, file name, URL, etc. is also 
stored). For security reasons, the physical property in- 
formation may be password protected or otherwise se- 5 
curely protected. 

[0059] Having a measure database for storage of 
measured physical property may be preferred by some 
users. Others may wish to store the measured physical 
property values locally in their own database where oth- io 
er association information is stored with the edge code. 
Still other users may wish to have the measured physi- 
cal property information encrypted and recorded or 
marked on the edge. An edge reader when reading the 
encrypted edge code would have to send the read en- is 
crypted portion to a decryption device or ignore it rf phys- 
ical property data is not needed. 
[0060] In a typical hard copy authentication process, 
a hash-value is created on the basis of the document 
content, for example, by scanning it, extracting text and 20 
picture characteristics and compressing this information 
(with loss) into a hash-value. The hash-value is digitally 
signed (it may also be encrypted if the content is to be 
hidden) and printed on the document itself, for example, 
using a private cryptographic key to sign the hash-value 25 
and printing it as a Dataglyph or barcode on thebottom 
of the document. Authentication consists of taking the 
Dataglyph marked document, computing the hash-val- 
ue given the text of the document, reading the signed 
hash-value printed on the document, and validating it 30 
against the computed one using the public cryptograph- 
ic key. 

[0061] The invention can also be used to extend ex- 
isting hardcopy document authentication methods by in- 
serting the unique edge code of the particular sheet of 35 
material in the document's authentication stamp. Refer- 
ring to Figure 12, document 200 includes document text 
204 which is printed on the surface of a sheet of paper 
which has been edged marked with edge code 202. An 
encryption hash 206 is created using both edge code 40 
202 and a portion of text 204. The hash 206 is then dig- 
itally signed (or optionally encrypted) 210 using private 
key 208. The digitally signed (or encrypted) hash is then 
printed as a Dataglyph 212 on document 200. Prefera- 
bly the digitally signed (or encrypted) hash is printed as 45 
a Dataglyph, but any other recording method or format, 
such as a barcode, may be used. The Dataglyph may 
be recorded on any portion of the sheet of material: first 
or second surface or on an unused portion of the edge. 
[0062] By using edge coded sheets of material that 50 
have been additionally pre-notarized, i.e., the validity of 
the identifier of an individual sheet can be verified by 
reference to a measure database, retrieving the pre- 
measured physical property and comparing it to the 
measured value, pre-notarized paper prevents a forger 55 
from producing a sheet of paper with a duplicate edge 
code. A forger may still damage the edge code, render- 
ing authenticity and originality in question. Pre-notarized 



paper tackles the problem of originality, by ensuring the 
originality of the medium, i.e. a sheet of paper, given its 
unique edge code identifier. 

[0063] Referring to Figure 13; document 200 includes 
edge code 202, text 204 and Dataglyph 214 which holds 
a signed hash-value for the purpose of authentication 
and originality verif ication. Document 200 was created 
using pre-notarized paper, i.e., text 204 was printed or 
recorded on a sheet of pre-notarized paper. In this ex- 
ample, pre-notarized paper was produced as follows 
during manufacture of the sheet of paper. The unique 
sheet identifier 202 was applied to the paper's edge. 
Some measure of some chosen physical property(ies) 
of the sheet of paper was taken. This taken measure 
was stored in a measure database and indexed by the 
sheet unique identifier 202. 

[0064] In Figure 1 3, the process for authenticating the 
document 200, i.e., whether the text is authentic, Is 
shown. The edge code 202 is read in an edge reader 
and used with a portion of text 204 to create a new hash 
value 216. Dataglyph 214 is read and decrypted using 
public key 220 in decryption engine 222 to generate the 
original hash value. The original hash value is compared 
in comparator 224 with the new hash value 216. If the 
result 226 of the comparison indicates the two are sub- 
stantially equal (within the level of loss tolerated by the 
encryption algorithm), the content (text) of the document 
200 is authentic. 

[0065] The next step is to verify that the document is 
original, i.e., that the sheet of paper is the same one 
used when the document 200 was created. Referring to 
Figure 14, the edge code 202 is read by an edge reader. 
The edge code 202 is provided to a database access 
230 which consults measure database 232. Measure 
database retrieves the physical properties measured 
234 for that sheet of material indexed by the edge code 
202. Given the type of physical properties measured at 
production time, those same measurements are made 
again to produce physical measures 236. Measures 234 
and 236 are compared in comparator 238. If the result 
240 is substantially equal, the paper is original. 
[0066] This solution to the joint-problem of authenti- 
cation and originality of hardcopy orphysical documents 
incorporates the unique edge code identifier of the sheet 
in the authentication, content-based, hash-value. The 
verification consists of reading the edge identifier, com- 
puting again the hash-value given the identifier and the 
document content, and finally validating the computed 
hash-value against the printed hash-value using the 
cryptographic public key. This method prevents manip- 
ulation of the content and prevents copy of the docu- 
ment because the sheet identifier cannot be forged. 
[0067] The strength of this method relies on the diffi- 
culty of producing a sheet of paper having both a given 
identifier and some given physical properties. With re- 
spect to the physical measure, the edge itself may be 
used as the physical property or characteristic that dis- 
criminates each individual sheet. For example, the edge 
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mark (identifier) applied on the edge involves applying 
some type of ink or dye on the edge. The way the ink or 
dye penetrates the paper edge is presumably difficult to 
forge and can therefore be a valuable measure. Alter- 
natively, the state of the "surface" of the edge and its s 
geometry constitutes a unique profile. An advantage to 
using the physical characteristics of the edge comes 
from the imposed usage of an edge reader (to read the 
edge unique identifier) at various steps of the method. 
If some of these edge readers are also able to adequate- to 
ly measure the characteristics above, they will serve two 
purposes in one operation, both for the paper producer 
and for the consumer. 

[0068] There are many schemes for managing the 
measure database. The database may be managed by is 
the paper producer that provides a service allowing the 
consumers to check the validity of the sheet identifiers. 
The database may be managed by a notarial service run 
by a third party; The consumer may purchase the pre- 
notarized paper together with the record of the physical 20 
measures. However in this case, the notarization may 
probably be valid only internally to the customer organ- 
ization. 

[0069] The initial physical measures can be entirely 
recorded, for example, as a compressed very high-res- 25 
olution picture of the pre-marked edge. The comparison 
of the physical measures is then possibly made on the 
entire measure, i.e., on the entire edge and not just a 
ponton The level of comparison may be user defined. 
For example, the consumer may make measures that 30 
are less accurate than the ones taken at production time 
while still being able to compare the two. This has a pos- 
itive implication in term of cost for the consumer (cheap- 
er machinery, faster to operate). The invention sepa- 
rates the originality of the medium from the authentica- 35 
tion of the content. 

[0070] The digital signature can include the unique 
identifier and content-based information, regardless of 
the size and type of physical measures. This leads to 
either a smaller footprint of the digital signature or an 40 
increased accuracy for the content representation (e.g. 
a bigger hash-value of the text). Quick and cheap veri- 
fications of the originality can be made by reading the 
sheet unique identifier and checking it together with the 
document content against the signed hash-value (with- 45 
out taking any physical measure). The strength of the 
originality verification is directly dependent on the diffi- 
culty for a forger to mark a given identifier on the edge 
of a sheet of paper. However, if special ink may be used, 
it may require an uncommon infrastructure and know- so 
how. Similarly, a consumer might avoid verifying the 
originality of the sheet (avoid taking any physical meas- 
ure) when computing the hash-value for a document, to 
make it easier and cheaper. The paper producer may 
keep secret some of the measures taken as an addition- 55 
al protection against forgery of the paper. But some of 
the properties should be public, to allow the consumers 
to run their own verification with their own devices if they 



want to do so. The choice of the physical properties to 
measure can evolve overtime almost transparently for 
consumers, while increasing quality of paper pre-nota- 
rization. 



Claims 

1- An encoded sheet material, comprising: 

a sheet of material having a first surface, a 
second surface disposed opposite the first surface 
and an edge extending between the first surface 
and the second surface and peripherally about the 
sheet of material, the edge having indicia arranged 
thereon to form a code uniquely identifying the 
sheet of material. 

2. The encoded sheet material of claim 1 , wherein the 
indicia are pre-marked during fabrication of the 
sheet material. 

3. The encoded sheet material of claim 1 , wherein the 
indicia are marked on the edge after information Is 
recorded on one of the first and second surfaces. 

4. The encoded sheet material of claim 2, wherein the 
encoded sheet material is fabricated as part of a 
ream of sheet material and the code includes a first 
portion identifying the ream and a second portion 
ident|fy|ng the encoded sheet material. 

5. The encoded sheet material of claim 4, wherein the 
edge comprises four sides and wherein the first por- 
tion is located on one side and the second portion 
is located on one of the other sides. 

6. A system for managing an encoded sheet of mate- 
rial according to any of the preceding claims, the 
system comprising: 

a code reader operative in conjunction with an 
encoded sheet of material for reading an edge 
code; 

a sheet processing apparatus for reading infor- 
mation from and/or writing information to at 
least one of the first and second surfaces of the 
encoded sheet material; and 
a processor in communication with the code 
reader device and the sheet processing appa- 
ratus for associating the information with the 
edge code. 

7. The apparatus of claim 6, further comprising a 
memory for storing a record of the association be- 
tween the edge code and meta data pertaining to 
the information. 

8. The apparatus of claim 6 or claim 7, wherein the 
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sheet processing apparatus comprises a printer 
and the information comprises a digital file stored 
in a memory. 

9. The apparatus of claim 6 or claim 7, wherein the 
sheet processing apparatus comprises a scanner 
and the information comprises an image scanned 
from the surface. 

10. A method of retrieving a physical document stored 
in a storage location, comprising: 

obtaining a code associated with the physical 
document; 

wherein the physical document includes at 
least one sheet of material having a first sur- 
face, a second surface disposed opposite the 
first surface and an edge extending between 
the first surface and the second surface and pe- 
ripherally about the sheet of material, the edge 
having indicia arranged thereon to form the 
code identifying the sheet of material, and in- 
formation recorded on at least one of the first 
and second surfaces of at least one of the 
sheets of material; 

using a scanning device to scan the storage lo- 
cation for the code; and 

when the output of the scanning device indi- 
cates the location of the code, retrieving the 
physical document from the indicated location. 

11. The method of claim 10, further comprising: 

indexing the location of the physical document 
by associating the code with the storage loca- 
tion; and 

storing the indexing information in a computer- 
based information retrieval system. 

12. The method of claim 10 or claim 11, wherein the 
code comprises a unique code. 



tying the sheet of material, wherein the sheet 
of materia! includes a first surface, a second 
surface disposed opposite the first surface and 
an edge extending between the first surface 
5 and the second surface and peripherally about 

the sheet of material; and 
recording the measured physical property in a 
measure database indexed by the edge code. 

10 16. The method of claim 15, wherein the physical prop- 
erty comprises at least one of paper fiber arrange- 
ment, edge surface and geometry profile, and ink 
penetration on the edge. 

15 17. The method of claim 15 or claim 16, further com- 
prising: 

creating an encryption hash from the edge code 
and a portion of the information; 
20 digitally signing the encryption hash; and 

marking one of the first and second surfaces 
with the digitally signed hash. 
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13. The method of any of claims 10 to 12, wherein the 
physical document includes additional sheets of 
material. 



45 



14. The method of any of claims 10 to 13, wherein the 
obtaining step includes searching an index of in- 
dexed physical documents in an information retriev- 
al system. 



50 



15. A method of creating an authenticatable sheet of 
material, comprising: 

measuring at least one physical property of the 
sheet of material; 

marking an edge of the sheet of material with 
indicia arranged to form a unique code identi- 
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